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foreign language experience and color word interference 



Cyrus R. Sisson 

Center for Research on Language and Language Behavior 
The University of Michigan 



If various color names are printed in various color inks an 
observer has great difficulty in rapidly naming the ink colors 
(Stroop Color Word Test) unless the color names and the ink colors 
are mutually reinforcing, or the color names are unknown to the 
observer. The latter suggests a partial measure of second-language 
fluency, the feasibility of which was examined by administering a 
bilingual version of the color word test to 4 college student 
groups whose familiarity with French differed. As might be expected, 
when the color names are French and the subject names the ink colors 
in English, greater familiarity with French is associated with 
greater interference in naming; however, differences between the 
groups in average naming time were not statistically significant. 

The Stroop color word test presents subjects with a conflict situation in 
which they must state aloud the color of the ink used to print a series of color 
words on a card. Interference arises from the fact that the word and the ink 
color both vary, but are always incongruent (^) . Performance time in this 
situation under conditions of set intended to elicit rapid, accurate performance 
is compared with the time it takes to perform two related tasks under the same 
set conditions, i.e., in one task, subjects read a similar series of color words 
printed in black (W) ; in the other task, subjects name patches of color ( C ) . 
Although there is no standard test with respect to materials, administration, or 
scoring, there is remarkable uniformity among the published findings. Without 
exception, investigators report that times are faster for W cards, somewhat 
slower for _C cards, and appreciably slower for CW cards. 

Differences between scores are affected by a host of variables, including 
practice, vividness and dissimilarity of colors, length of rows of items, age and 
sex of subjects, and several other factors discussed at length in a recent review 
(Jensen & Rohwer, 1966) . Included in the review are evaluations of the methods 
used for administering the test, as well as recommendations for an improved and 
standardized form of the test. These recommendations were taken into consideration 
in conducting the present experiment which employed the Stroop Color Word Test 
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to measure the extent of interference experienced by individuals, varying in 
degrees of familiarity with the French language, when they were required to give 
the English name of the ink color in which French color words were printed. 

The study was prompted by Lambert’s investigations (1967) in which balanced 
and non-balanced, compound and coordinate French-English bilinguals were required 
to name the ink color of French color words, English color words, and non-color 
words in both languages under a set for both speed and accuracy. Among o<-her 
results, Lambert reports that, with color words similar to those used in this 
experiment, interference is less when the language in which subjects name the 
colors differs from the language in which the color names are printed. Further, 
when the subjects are native speakers of English studying French, interference 
is less when the response language is English and the interfering language is 
French. Lambert’s series of experiments led to speculation that the degree of 
interference may be directly related to the level of familiarity with the inter- 
fering language. 

Accordingly, if subjects with varying experience with the French language 
are asked to name the colors in which a group of French color words are incon- 
gruously printed, we may expect that the subjects with no familiarity with French 
will perform the task fastest, subjects with limited familiarity with French 
will take longer, subjects with extensive familiarity with French will take even 
longer, and native speakers of French (albeit fluent in English) will require 
the longest time of all. The present study evaluates these expectations, hence 
the value of the bilingual Stroop test as an index of second language familiarity. 

Materials 

The three types of cards used were prepared with commercially available 
Carter’s Extra Service Felt Stamp Pads, and rubber stamps of elite size type. 

Since stamp pad inks did not contrast well with the black background recommended 
by Jensen and Rohwer for its "aesthetically more striking appearance" it was 
decided to use white-non-glossy card stock cut to 4 1/2' x 6" dimensions. All 
frequent English color words were considered for use in this study, but only 
four satisfied these two requirements; (a) the color is available as a stamp- 
pad ink color, contrasting well with a white background, and (b) the English 
color name has a phonemically and orthographically dissimilar French translation- 
equivalent. The four French-English pairs of color words selected were vert- 
green, rouge-red, noir-black, and mauve-purple. Evidently we were constrained 
to use several French words with some currency in English. 
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Each card type consisted of six rows with eight elements each. The words 
and colors used on the various cards were systematically varied to insure that 
all possible sequences of color words and inks occurred an equal number of times 
within the series, that no color word or ink was adjacent to itself in any of 
the rows or columns on any of the cards. The ^ cards consisted of 1/4 diameter 
dots made with an unblemished standard pencil eraser and the stamp 
pads described above. The format of a typical _C card is shown in Figure 1. 

The model in Figure 2 is representative of the ^ cards which were prepared using 
the stamp pads and rubber stamps. The W cards were identical with the GW cards in 
format except that English color words were used and were typed in black ink. (Figure 3) 

Insert Figures 1, 2, and 3 about here 



Procedure 

Sixteen subjects were selected from University of Michigan undergraduate 
and graduate students? four had no experience in French or any other Romance 
language; four were in the second semester of college French; four were senior 
French majors who had spent the previous year studying in a French university j 
and four were native speakers of French whose second language was English. All 
subjects were unpaid volunteers. 

The test was administered to each subject separately under nearly identical 
conditions. Each subject was seated at a desk in a room illuminated only by a 
Tensor lamp (model 5975, output 8.1 O.C.U. 29 V.A.). The lamp was focused on the 
various exposed cards. Each subject familiarized himself with an example of the 
test cards while the instructions quoted below were read to him. Any questions 
were answered by re~reading the instructions. The performance of each subject 
in each phase of the test was measured using a card he had not seen. The selection 
of the particular card with which a subject was tested was made randomly and 
independently except that in order to minimize possible sequential practice effects, 
all the cards presented to any one subject contained different response sequences. 
The order of card presentation was W, C, W and the following instructions 

were read in the order given. (Instructions for first W card) 

"During the first part of the experiment, you are to read aloud 
a paragraph like this one containing a series of four words: red, 

black, purple, and green. Read as rapidly as you can, since you 
will be timed. At the same time, do not make any mistakes as they 
will counl against you. Do you have any questions?" 
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(Instructions for the _C card) 

"Now I am going to show you a card like this one on which there 
is a series of colored dots. I want you to state aloud the color 
of the dots as quickly as you possibly can without making any mis- 
takes. The dots are either red, green, black or purple. Do you 
have any questions?" 

(Instructions for the ^ card) 

"Next I am going to show you a card similar to this one, on 
which there is a series of four French words printed many times 
in either red, purple, green, or black. As you can see, the words 
are vert, rouge, mauve, and noir. You are to name the color of 
the ink in which each word is printed. If the French word is printed 
in green ink, say ’green’, in purple ink, say ’purpld’, etc. Remember 
to work quickly as possible while avoiding mistakes. Are there any questions?" 

(Instructions for the second W card) 

"Now you will repeat the first part of the experiment by read- 
ing out loud a paragraph of English color words. Once again, you 
will be timed and mistakes will count against you. Do you have 
any questions?" 



Results 

The performance time in seconds for each subject in individual tasks, together 
with the group means and standard deviations are presented in Table 1. The mean 

Insert Table 1 about here 

performance times for the repeated tasks (W and CW cards) are indicated in coluruns 
X and X . Overall means by group by task are shown in Figure 4. Mean W card 

W CW 

Insert Figure 4 about here 

time seems fairly uniform for all groups, mean jC card time for Group IV is somewhat 
higher than for the other groups, and mean CW card time rises as a result of 
the degree of familiarity with the French language. In addition, the group rank 
of each subject for each test shown in Table 2 indicates a general consistency 
of performance across all tasks. 

Insert Table 2 about here 



A two-way analysis of variance was performed for groups and tasks with the 
latter highly significant (F = 77, df = 4, 48, p « .01). No scores violated 
the general observation of this and earlier studies that performance time increases 
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from performance on W cards, through _C cards to cards. The effec’* of groups 
is significant (F = 4.0, df = 3, 12, p << .05) as is the interaction (F = 3.6, 
df = 58, 16, p << .01). Most of the intergroup difference found seems to lie 
among the scores received on the _C cards and ^ cards. The scores on all three 
tasks were analyzed with the following results. 

No statistically significant difference exists among group times on W cards, 
either during or between the first or second exposure to them; however, the scores 
for Group IV are slightly higher on the average than those for the other groups. 

It is interesting to note that both Groups III and IV exhibit increased performance 
times during the second exposure to W cards , (W^) which took place after the ^ 
and GW cards. 

There is a statistically significant difference between the £ card mean for 
Group IV and the £ card mean for the other three groups combined. There is an 

[p(|t^^J.> 6.15) < i 2.98) = .01 = a] 

analogous difference between these groups with respect to card performance times. 

[p(|t 3 oJ > 4.53) < pdtjgl > 2.75) = .01 = a] 

Jensen (1965) evaluated the 11 formulas for deriving scores from the Stroop 
test that have been reported in the more than 60 published psychological studies 
in order to determine their reliability. A large group (N = 386) of undergraduate 
students in an introductory educac.ional psychology course (University of California) 
was tested on two occasions and another group (N = 50) was tested on 10 occasions, 
separated by at least one day. All the data were scored with each of the formulas, 
and all of the derived scores were intercorrelated and subjected to multivariate 
analysis. The conclusion of the study was that only three factors account for 
all variances among the derived Stroop scores. Factor I, represented by the for- 
mula C/C + W, is a measure of individual differences in the degree of difficulty 
subjects have in naminp colors with the "speed" factor, i.e., speed of reading 
words partialed out. Factor II, whose formula is CW-C, is the interference factor 
indicating the increment of difficulty of color-naming introduced by the conflicting 
color words. Factor III is clearly represented by the raw time score on the W 
card and represents the reading speed factor. 

These three factors were applied to the scores collected in this experiment 
and analyzed for significance. The W card scores (Factor III) are not signifi- 
cantly different among the four groups. The group scores on Factor II are pre- 
sented in Table 3 together with means, variances, and standard deviations. 
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Insert Table 3 about here 



These data show a sharp division between the scores for Group I and the scores 
for the other three groups which is statistically significant. 

[pdtjQ I > 3.01) < pdtjg I > 2.98 = .01 = a] 

Table 4 shows the group scores derived for the computation of Factor I, the 
color-naming factor. The differences between the mean score for Group IV and the 
mean score for the other three groups is also significant. 

I i 5.48 > p(|t^^| > 2.98) = .01 = a] 

Insert Table 4 about here 



Discussion 

On the basis of previous studies that investigated the differences in the 
magnitude of the interference effect when the words printed in varying colors 
were nonsense syllables and non-color words, it was anticipated that the overall 
effect of presenting French color words to subjects with varying amounts of ex- 
perience with the French language would be large enough to override the effect 
of differences among subjects with respect to age, reading speed, and color naming 
speed. In fact, variations in the performance time on the several tasks were so 
great that statistically significant differences among all the groups with respect 
to ^ card performance times were impossible to obtain, although there is a sig- 
nificant difference between the mean ^ performance time of Group IV, native speakers 
of French and the mean for the other three groups. Probably matching on the vari- 
ables mentioned would reduce substantially the intragroup variance. 

Although the French subjects probably had less experience with reading English 
aloud than the subjects of the other three groups, it was expected that all groups 
would be fairly uniform with respect to performance on the W cards. Table 1, 
however, shows a curious split among the mean performance times for the W cards, 
which separates these times into two groups — an "American" group consisting of 
Groups I and II and Group III before its exposure to the Fre ich conflict cards, 
and a "French" group consisting of Group III after exposure to French and Group 
IV. It is interesting to speculate whether the sudden plunge into a situation 
which tends to provoke thinking in French has the effect of inhibiting performance 
in English for those subjects with the greatest French language experience. 

er|c 
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For Groups I and II, the CW cards may well be more or less analogous to the 
nonsense syllable and/or non-color word cards used in earlier studies (Klein, 

1964; Lambert & Preston, 1967; Schiller, 1966). Such cards were the source of 
much less conflict than standard CW cards as measured by Factor II. If experi- 
encing conflict has any effect on subsequent performance on W cards, it is more 
reasonable to assume that Groups III and IV would demonstrate this than Groups I 
and II for whom French words may be akin to nonsense syllables. 

It was anticipated that the C card task would pose a slightly more difficult 
problem for Group IV than for the other three groups. Either of two processes 
may be taking place: (a) the color patch may elicit the mediating response of 
a French color word for the compound bilinguals of Group IV which then has to 
be translated into English, or (b) the subjects who were strong coordinate bi- 
linguals may simply have more difficulty bringing the English word to mind for 
the color-patch stimulus. Either of these two situations, a combination of them 
both, or other similar processes could explain the relatively slower performance 
of Group IV on this task. As no attempt was made to ascertain the degree to 
which these subjects were compound or coordinate bilinguals (Lambert & Preston, 
1967), no resolution of this anomaly is immediately obvious. Although the per- 
formance time variance of Group IV on this task was large, difference between 
the means of native speakers of French and the native speaking of English was 



significant. 

The differences among group ^ card times are also large and in the predicted 
direction for both exposures to these cards. Again, intragroup variance and the 
small populations make it unlikely that every group will be significantly distinct 

when raw scores alone are statistically analyzed. 

The analysis of the transformed scores making up Factors I and II reveals 
a significant difference between the American subjects and the French subjects 
in color-naming scores (Factor I). Furthermore, an inspection of the means and 
standard deviations of all the groups with respect to the Factor II scores of 
Table 3 indicates that all subjects with some knowledge of French experience 
increasing amounts of conflict between French color words and the ink color m 
which they are printed. The evidence for this conflict lies in the increasing 
CW card performance times and group variances as a result of extent of French 
language experience. The difference between those subjects who have been exposed 
to French and Group I manifests itself as much in the changing magnitude of the 
mean performance time as in group homogeneity. 
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Finally, it should be noted that the combination of response and interference 
language selected for this experiment (English and French respectively) occasioned 
the smallest amount of interference in the study reported by Lambert and Preston 
(1967). The fact that the differences across groups in CW card performance times 
are large and consistent indicates that in all likelihood, an expanded, more sophis- 
ticated version of this test w'Lh subjects matched on age, reading speed, and 
color-patch naming speed, would yield statistically significant ^ card scores. 
Conceivably it could also have important pedagogical implications, particularly 
in the measurement of the extent to which tudents have truly acquired foreign 
language habits. 



Summary 

The results of studies in which the Stroop Color Word Test has been used with 
nonsense syllables and non-color words printed in varying ink colors has led to 
speculation that French color words presented to subjects with varying amounts 
of French language background would produce similar large-scale effects. Admin- 
istration of this variant of the test produced results in the predicted direction, 
but failure to matcn subjects on the relevant variables of age, reading speed, and 
color-patch-naming speed made the significance of the results tenuous, except that 
native speakers of French are significantly slower in the two phases of the test 
in which color patches were named in English and the ink color of French color 
words were named in English. It is hypothesized that matched groups of subjects 
will exhibit significant differences in performance time on the French color word 
conflict cards and that the consistency of these differences will be useful 
pedagogically. 
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Figure Captions 

Fig. 1. jC card format. The C cards consist of an 8 x 6 matrix of colored 
dots approximately 1/4” in diameter of either red, purple, green, or black stamp 

pad ink against a flat white background. 

2. CW card format. These cards consist of an 8x6 matrix of French 

color words (rouge, noir, mauye, and yert) prepared on a flat white background 
with standard stamp pad inks corresponding to the color words, but with each 
word printed in a conflicting color. 

Fig. 3. W card format. These cards are similar to the cards except that 
all color words are typed in black. 

Fig. 4. Group means in seconds for each of the tasks performed in a bilingual 
Stroop Color Card Test. 
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Table 1 

Perforaance Times in the Three Phases of a 
Bilingual S troop Test 
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W, 



W, 



X 



CW, 



CW, 



X 



w 



cw 




Individual performance les to the nearest .1 second by task for each 
subject. Subscripts indicate presentation order of tasks. W = English 
color word reading, X^ = Means for word tasks, C = Color patch naming task, 
CW = ink color naming task for French color words, = Mean for CW task. 
Task means, group means and standard deviations are presented. 
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Table 2 

Rank Order by Task for the Three 
Phases of a Bilingual Stroop Test 



11 




Rank order of subjects with respect to performance time 
on the three tasks of a bilingual Stroop Color Word Test. 
Subscripts indicate presentation order of tasks, W = 
English color word reading, C = Color patch naming task, 
CW = ink color naming task for French color words. 
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Table 3 

Bilingual Stroop Test 
Interference Factor Scores (CW-C) , 
Group Means, and Standard Deviations 

Group Means , Variances , 
and Standard Deviations 



Group I 

Mo French 
•'Experience 

SD 


8.9 

6.0 

2.8 
2.6 

7.5 

5.5 

1.1 

4.8 


= 4.9 
SD^ =6.1 
SD^ = 2.5 


Group II 

One Year 
of French 

X2 

SD2 


4.4 

3.0 

14.8 

13.3 
6.8 

13.8 

15.4 


X 2 = 10.6 
SD 2 =22.2 

SD2 = 4.7 


Group III 

French 

Majors 

^3 

SD 3 


11.6 

11.0 

27.1 

20.1 
. 6 

2.6 

12.1 

17.1 


X^ = 12.8 
SD^ = 66.8 
SD^ = 8.2 


Group IV 


17.5 

19.5 


X. = 12.6 


Native 


6 • 6 


4 


French 


0 0 


2 




2.3 


sd; = 27.6 




12.3 


4 




14.3 




SD, 


13.9 


SD^ = 5.3 


4 


14.2 





^ 2 , 3.&4 = 
S®2.3.&4 = ^-3 
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...e difference between performance times to the nearest .1 sec. 
in two phases of the Stroop Color Word Test for four groups dif- 
fering in French language experience. (CW = ink color naming of 
French color words, C = color patch naming). The mean performance 
times X variences SD^ and standard deviations SD are indicated for 
each group and for Groups II, III, and IV combined. 




Table 4 

Bilingual Stroop Test Color-Naming Factor I Scores (C/CW) , 
Group Means'^. Variances, and Standard Deviations. 



Group I 

No French 
Experience 

SD^ 


.56 

L = .58 

.61 \ 

SD^ = .00042 

.58 ^ 

SD = .020 

.56 


Group II 


.58 




X„ = .56 


One Year 


.55 


of French 


SD^ = .00015 


Xo 


.55 ^ 


2 


SD„ = .012 


3^2 


.56 


Group III 


.53 


French 


^3 = .55 


Majors 


.57 2 




SD. = .00082 


^3 


.59 ^ 




SD. = .029 


SD 3 


.52 3 


Group Iv 


.68 


Native 


X, - .64 

^ ^ 4 ” 


French 


.65 . 




SD, = .00052 




.63 




SD, = .023 


SD, 

4 


.62 * 



The ratio between performance time in the 
color patch naming task (C) and the ink color 
naming of French color words task of a bi- 
lingual Stroop Color- Word Test (C/CW), 
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Figure 1 



vert rouge noir mauve noir vert mauve rouge 
mauve vert rouge noir mauve rouge vert noir 
rouge noir mauve vert rouge mauve noir vert 
noir mauve vert rouge vert noir mauve rouge 
mauve noir rouge vert mauve rouge noir vent 
noir rouge vert mauve noir vert rouge mauve 



Figure 2 
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purple green red black green black red purple 
black red green purple red purple black green 
green purple black red purple red green black 
red black purple green black green red purple 
green red black purple green black purple red 
purple black red green red purple black green 



Figure 3 
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_J ^ ^ ^ L_ 

W, W5 C2 CW3 cw. 



English Color Words Color Potches French Color Words 

in Incongruent 

Color Inks 

Figure 4 
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